Towards an Assessment of an AI System ' s Validity by a
نویسندگان
چکیده
Although there seems to be no (formal) way of proving the validity of an AI system, the authors present some ideas on developing a validity statement based on a Turing-test methodology with a set of "good" 1 test cases. The solution of these test cases will be rated by a panel of expert validators. The methodology is called the Turing-test, because a random process of distributing the test case with solutions to the diierent valida-tors ensures that no validator knows who the author of a test case solution is. The objective of this is, of course, to make the result of the validation process (the validity statement) more objective. Furthermore, in an eeort to maximize objectivity , the approach described here includes a competence scale for each validator. This is done for each test case separately due to the fact that competence is a property of an expert , which normally isn't distributed homogeneously within the "input space" of the AI system. The degree of competence is estimated by considering the experts' behavior while solving the test cases and rating the test case solutions. The main sources of this competence esti-1 "Good" in this context is the subject of other papers published by the authors and their group. mation are the experts' own competence assessment , their behavior while rating the own solution, and the rating of their solution by other experts.
منابع مشابه
Towards a Task-Based Assessment of Professional Competencies
Performance assessment is exceedingly considered a key concept in teacher education programs worldwide. Accordingly, in Iran, a national assessment system was proposed by Farhangian University to assess the professional competencies of its ELT graduates. The concerns regarding the validity and authenticity of traditional measures of teachers' competencies have motivated us to devise a localized...
متن کاملAn integrated Assessment System of Citizen Reaction towards Local Government Social Media Accounts
Agovernmentshouldusesocialmediaforcommunicatingwithitscitizen.Theengagement index score is one of the methods for assessing the rate of governmental success in using social media as a tool in establishing interactive relationships with its citizen. In general, the engagement index score is obtained by calculating the number of posts, number of likes and comments, and so forth on a single social...
متن کاملTowards an Assessment of an Ai System's Validity by a Turing Test
The authors present some ideas on developing a validity statement based on a Turing-test methodology with a set of "good" test cases. The objective of this is, of course, to make the result of the validation process (the validity statement) more objective. Furthermore, in an eeort to maximize objectivity, the approach described here includes a competence scale for each validator. This is done f...
متن کاملA Student Assessment System Framework
Background & Objective: One of the factors for achieving quality improvement of educational programs defined establishing student assessment system at universities. The aim of the study was to develop a framework for a student assessment system. Materials and Methods: The present study is an educational scholarship study that conducted in three phases. In the first phase, the reviewing the lit...
متن کاملDevelopment and validation of work permit system performance assessment questionnaire, a case study in an Iranian oil refinery
Background: Permit-to-work system is a process used to prevent accidents in the process industries. Evaluation and monitoring of the performance of a permit to work system reveal its inherent weaknesses and reduce accidents in process industries. Since there exists no local tool for monitoring the performance of permit-to-work system in refineries and process industries, such as petrochemicals,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007